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formalism have dispersion relations in which propagation planes factor into null planes cor¬ 
responding to two effective metrics which depend on the point-wise values of the electromag¬ 
netic field. These effective Lorentzian metrics share the null (generically two) directions of 
the electromagnetic field. We show that, the theory is symmetric hyperbolic if and only if 
the cones these metrics give rise to have a non-empty intersection. Namely that there exist 
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is well-posed. We illustrate the power of this approach with several nonlinear models of 
physical interest such as Born-Infeld, Gauss-Bonnet and Euler-Heisenberg. 


* jferal8@gmail.com 

t fedecarrasco@gmail.com 

* egoulart2@gmail.com 

§ reula@famaf.unc.edu.ar 



2 


I. INTRODUCTION 

Nonlinear electrodynamics (NLED) is relevant in several areas of physics. In QED, the polar¬ 
ization of the vacuum leads naturally to nonlinear effects (such as the light-light scattering) which 
are effectively described by Euler-Heisenberg’s Lagrangian pi, m , ®, 0 , m, 0 (see also [Sj for 
a pedagogical review). In some dielectrics and crystals, the interaction between the molecules and 
external electromagnetic fields can be described by an effective nonlinear theory, which is typically 
observed at very high light intensities such as those provided by pulsed lasers 0 , m • Possible 
consequences of NLED have been explored also in cosmology and astrophysics. In particular, it 
is believed that nonlinearities may play important roles in the description of the dark sector of 
the universe na, m, m, usi, m, in the avoidance of singularities (when coupled to Ein¬ 
stein’s equations) or in the physics of charged black holes, pTTJ , jN.jl!)]. [20] . [21], [22] . These 
models have the advantage of using only electromagnetic fields, without invoking yet unobservable 
scalars or more speculative ideas related to higher-dimensions and brane worlds. Finally, Born- 
Infeld nonlinear model [231425] has mathematical connections to string theory, for its Lagrangian 
appears in relation with the gauge fields on a D-brane (see, for instance, [26]). In general, NLED 
theories attracts attention because they offer insights into light propagation in the experimental 
and theoretical studies of relativity. 

A key feature to elucidate about the partial differential equations (PDEs) governing NLED 
is whether they pose an initial value formulation. Well-posedness is at the roots of physics, for 
it amounts to the predictability power of the theory, asserting that solutions exist, are unique 
and depend continuously on the initial data. The mathematical theory dealing with the initial 
value formulation is well developed and for this case it amounts to check whether the first order 
system of quasilinear PDEs is hyperbolic and what are the maximal propagation speeds. Roughly, 
hyperbolicity is an algebraic property of the principal symbol (the differential operator consisting 
of the highest derivative order terms of the PDE), which is essential to prove local well-posedness of 
the non-characteristic Cauchy problem. In theories like NLED the analysis of hyperbolicity needs, 
however, a careful manipulation. Because constraints are present, the evolution equations are not 
uniquely defined for one can add to any one of them constraint terms and obtain another equivalent 
evolution system. The distinction of an evolution system over the others generally implies a choice 
of 3 + 1 decomposition of space-time and, as such, breaks covariance. 

In this paper we shall treat the equivalent evolution systems in an equal footing so as to assert 
the desired hyperbolicity properties keeping covariance as much as we can. The tools to deal with 
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hyperbolicity in this way has been provided by Geroch in p]J. However, as there are many different 
notions of hyperbolicity available in the literature (see, e.g. m, [zs], m), we now recall some 
previous results concerning the evolutionary aspects of NLED. In m, for instance, Brenier uses 
the energy density and the Poynting vector as additional unknowns to augment the original 6x6 
Born-Infeld system to a system of 10 x 10 hyperbolic conservation laws (for which well known 
results can be directly applied [31]) [32]) [33] ). A similar analysis using convex entropies (actually 
the energy) may be found in [34] . where Serres extends a method designed by C. Dafermos [35] 
(see also |36j) to the class of models described by Coleman & Dill m- As a result, he shows that 
the polyconvexity of the energy density implies the local well-posedness of the Cauchy problem 
within smooth functions of class H s with s > l + d/2. Another approach using a 3 + 1 splitting was 
provided by V. Perlick in [38], assuming that the constitutive equations Fa(H, F) = 0 can be solved 
for E and H. Several results that are relevant for the question of whether the evolution equations 
are hyperbolic, strongly hyperbolic or symmetric hyperbolic are investigated in details. It is worth 
mentioning also the global results by J. Speck [39]. Using ideas presented by Christodoulou and 
Klainerman m, m he establishes the existence of small-data global solutions to the Born-Infeld 
system on the Minkowski space background in 1 + 3 dimensions. Roughly, he concludes that 
if the initial data for the Born-Infeld equations are sufficiently small as measured by a weighted 
Sobolev norm, then these data launch a unique classical solution to the equations existing in all of 
Minkowski space. Furthermore, he shows that these solutions decay at exactly the same rates as 
solutions to the linear Maxwell system. 

Here, we adopt a different strategy. Working within Geroch’s geometrical formalism we find 
the most general hyperbolizations NLED theories admit and show that they’re parametrized by an 
auxiliary vector field t q (x) (as is also the case for linear electrodynamics). This construction allows 
us to find necessary and sufficient conditions theories and fields must satisfy in order to have a well- 
posed initial value formulation. It happens, such condition translates very nicely into geometrical 
terms: the system is symmetric hyperbolic, if and only if, the two cones arising from the dispersion 
relations (or, conversely, the characteristic surfaces) of the given NLED theory have a non-empty 
intersection. This constitutes one of the main results of the present article. Our construction 
also allows to find the causality cone (the maximal propagation speeds) each hyperbolization have. 
These cones could be, in principle, different from the physical cones as defined by the dispersion 
relations, for the latter ones use all equations, including the constraints, to assert the propagation 
velocities of plane waves, while the hyperbolization cones use only the corresponding evolution 
equations. We find that, nevertheless, they coincide. 
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A warning on terminology: In the present context, cones appear in several related disguises: 
First, we have the familiar cones arising in Lorentzian geometry, namely those arising from the set 
of all time-like vectors {v G T p M| g a bV a v b > 0}, which splits into two disjoint sets, the “future” and 
“past” propagation cones. We shall reffer to them as the cones of a given metric; Second, now allow¬ 
ing for more general symmetric-hyperbolic systems, the cones that appear as the set of co-vectors 
which make positive definite a certain symmetric hyperbolizer, C* H := {n a G T*M| //“ g n a > 0}. 
These cones are clearly open and convex, for if n a and n! a belong to C* H so does Ain a + A 2 n! a for 
all positive Ai, A 2 , since the sum of positive bi-linear forms gives another positive bi-linear form. 
They represent planes on the tangent space. Once a plane is found within C the others are found 
tilting it until H^n a gets a kernel. Each one of these planes represents a plane wave perturbation 
solution to the underlying equations out of which the symmetrizer was built upon; Third, we can 
construct the co-cones, duals to the previous ones, that is, given a cone, C G V we can define a 
cone in the dual space, C' := {a G V'\ a(v) > 0 , Vu G C}, this is clearly also a convex cone. 
For the cones defined from hyperbolizers these cones are called the propagation cones , for they 
determine the directions along which perturbations propagate. Notice that the co-cones of a given 
metric cone are precisely those covectors obtained by lowering with the metric the indices of the 
all the vectors forming the cones, so the distinction is lost. However, since in this work there ap¬ 
pear several Lorentzian metrics it is best to keep cones and co-cones as separate geometrical entities. 

This article is organized as follows: In Section II, we introduce NLED equations and point 
out some of their basic structural properties. We then review some important results concerning 
characteristic surfaces and deepen on the geometrical aspects these theories exhibit; In section III, 
an intrinsic geometrical formulation of PDEs due to Geroch is introduced and the notion of sym¬ 
metric hyperbolicity is presented. We then investigate the algebraic core of the NLED equations 
and find the most general hyperbolizations these theories allow. Our main results are presented 
in this section in the form of two theorems, together with (we hope) comprehensive descriptions 
of the key ideas behind the proofs; In section IV, we provide the reader with the detailed steps 
to prove the theorems and thus while Section V deals with the constraints; Finally, Section VI 
explores our results for some particular realizations of the Lagrangian. 
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II. GENERAL REMARKS 

A. Lagrangians and equations of motion 

Write (M, g ) for a (l+3)-dimensional space-time, with signature convention (+, —, —, —), and 
F ab for the electromagnetic 2-form. Let 

F := F ab F ab = 2(H 2 - E 2 ) G := F ab F ab = AE.H (1) 

* 

denote the field invariants, where F ab = \rj a bcdF cd stands for the dual, r] abcc i the completely anti¬ 
symmetric Levi-Civita tensor and (E, Ft) the electromagnetic fields. We consider nonlinear models 
of electrodynamics in vacuum provided by the action, 

S = J C(F,G)V=g d 4 x, (2) 

where the Lagrangian density C{F,G ) is an arbitrary smooth function of the invariants and g := 
det(g ab ). The first-order, quasi-linear, equations read as 

V a [c F F ab + C G F ab ^j = 0, V [ a F bc } = 0, (3) 

with V denoting covariant derivative and C\ '■= §§f, for conciseness. Here, the l.h.s system is 
obtained via the variational principle while the r.h.s system is assumed from the very beginning, 
thus guaranteeing the existence of a four-potential such that F ab = V[ a A b j. We define also, for 
future convenience, the quantities: 

6 : = 2 Cff/C-f, £2 := 2 Cfg/C-f, £3 := 2 Cgg/C-f- (4) 

Remark 1. In Maxwell’s electrodynamics, there exists a particular gauge which considerably sim¬ 
plifies the second order equations for A a (x). Unfortunately, this is not the case for more general 
nonlinear theories and it will be convenient to focus our analysis on the first order equations ([3]) 
only. Also, currents j a (x ) are irrelevant for the question of whether the initial value problem is 
well-posed. Therefore, they ’ll play no role in our further discussion. 

B. Dispersion relations, characteristics and effective metrics 

It is well known that high-frequency perturbations about a smooth background solution of Eqs. 
Q are controlled by two effective metric^] The core of this result was first presented by Boillat 


1 See [42] and 03] for a wealth of details in the context of analogue models of gravity. 
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pH ] and Plebanski [43J in the early 70’s using Hadamard’s method of discontinuities [46], More 
recently, Obukhov and Rubilar m noticed that quasi-linear PDE’s of the form ([3]) are particular 
instances of electrodynamics inside media described by general nonlinear constitutive laws. As a 
consequence, they showed that if X is a characteristic hypersurface described by f(x a ) = const, the 
wave normals k m := d m f are given by the vanishing sets of a fourth-order multivariate polynomial 
in the cotangent bundle T*M 

V\x, k ) := <S abcd (x ) k a k b k c k d = 0. (5) 

Here, (3 abcd (x ) is a completely symmetric quantity (35 independent components) depending im¬ 
plicitly on the background solution. Physically, ([5]) plays the role of a dispersion relation for the 
linearized waves and gives rise to some sort of covariant Fresnel equation [3SP [Ej- 

For the class of nonlinear Lagrangian models provided by Q a remarkable property holds due 
to algebraic conditions: the multivariate polynomial ([5]) always reduces to the simpler form 


V* (x, k) = ak A + Qk 2 ! 2 + Rl 4 

with k 2 = g ab k a k b , l 2 = F a c F bc k a k b , for conciseness, and 

(6) 

a: = {l + Z 2 G-Z 3 F-RG 2 /16), 

(7) 

Q : = 2(6+6 -RF/4), 

(8) 

R ■ = 4 (66 - tl) • 

(9) 


A closer inspection of ([6]) reveals that the quartic Fresnel surface of the wave normals factorizes to 
the product of two second order surfaces, given in terms of the quadratic forms 


Qi b {x)k a k b = 0 , Q2 b (x)k a k b = 0 , ( 10 ) 

with the reciprocal effective metrics given by 

flf : = ag ab + h^F*, (H) 

Qf: = g ab + (b 2 /o)F a c F bc , (12) 

according to the following definitions [^] 


h := 


Q + V A 


b 2 : = 


Q-V A 


A := Q 2 - 4 aR. 


(13) 


If a —> 0 but bi 0 the left hand side of (12 1 is still finite since 62 /a = R/bi. When a —> 0 and 61 —» 0 
simultaneously we can, using the conformal freedom, redefine the metrics as gj 6 := (l/6i)gi i 02^ := 61 gf 1 . The 
results we shall obtain do not depend on this reparametrization. 
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As stressed by Boillat [35], the roots always exist since the discriminant is actually a sum of squares, 
i.e. A = 4(iVf + iV|) with 

Ni ■= (£1 - 6) - RF/ 4, N 2 := 2^ 2 - RG/ 4. (14) 

Obukhov and Rubilar proceed by showing us that g“ fc and Q? 2 b are Lorentzian whenever the 
background spacetime metric is Lorentzian. Their results are in qualitative agreement with previous 
results by Novello et al [50], m wherein the rays spanning the characteristic surfaces are often 
described in terms of effective null geodesics. Therefore, in what follows we shall assume that the 
effective metrics are always LorentziarJ^J For the covariant components of the latter we write g^ fe 
and Q 2 ab . They’re such that dTslb = ^"b and 02 c 0cfe = 

C. Geometrical Structure: the cones 

The effective metrics are defined up to a conformal transformation. However, there are on them 
an intrinsically geometric property which does not depend on conformal redefinitions, namely their 
cones [^J We will denote by C Si , the set of all timelike (future-directed) [^] vectors with respect 
to the metrics g,, and C*. := int ({n a e T^Mj n a v a > 0 , \/v a G C7 0i }) [®j the dual of these cones, 
or co-cones. We study here some geometrical relations among these cones, since they’ll play an 
important role in the description of hyperbolicity. In order to do that, we distinguish between 
two different cases, according to the nature of the electromagnetic field F ab : i. non-degenerate 
fields (F 2 + G 2 / 0); ii. degenerate fields (F = G = 0). As these cases have different geometric 
interpretations, we shall treat them separately (see also [52]). 

1. Non-degenerate F ab 

A non-degenerate 2-form at a point p, has (at that point) two null eigenvectors k a and l a 
(see appendix A for further details). The directions of these vectors are called the principal null 
directions (PND’s) [534455] and will play a key role in our description. 

3 A degenerate metric does not give rise to a cone according to our definition. Therefore, we analyze them separately 

in appendix B, where we conclude those systems are not symmetric hyperbolic. 

4 By a cone we mean the interior of a proper cone. Recall that a proper cone C is a subset of a vector space V such 

that: (au + / 3 v) € V Vu,v6t V a, /3 > 0 and C H —C = 0 . 

5 For symmetric hyperbolic systems once a symmetrizer is given we will adopt for all effective metrics the convention 

that the future cones are those with a non-vanishing intersection with the corresponding propagation cone. In the 
case the system is not symmetric hyperbolic we shall take into account all cones, two for each metric, and refer to 
all of them as cones. 

6 We choose the interior as to make the dual cones open, and thus, put them in an equal footing with the cones C Bi . 
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Remark 2. k a and l a are null with respect to the three metrics, that is, the background and the 
effective metrics. We shall say henceforth that the closures of all cones share these principal null 
directions. The plane spanned by k a and l a form the essential geometrical structure of the system. 


The PND’s help us to construct a frame which simplifies calculations considerably. In this 
frame, E || H, g ab reduces to rj ab and one obtains the quadratic forms 

Q l ab t a t b := af 1 (t§ - tf) - pf 1 (t\ + tl) gfn a n b := a* (rag - n§) - A (ra? + rag) (15) 


with i = l ,2 and 


ai :=a + b i± (f - s/F 2 + G 2 ) A :=a + b i| (f + VF 2 + G 2 ) 

a 2 :=l + %\ (f-VF 2 + C^ A := l + %\ (f + VF 2 + G 2 ) 


(16) 


Note that the quantities involved in (16) are covariantly defined, i.e. they are functions of the 
invariants F and G. 



(a) Pi > 0 and P 2 > 0 


(b) Pi > 0 and P 2 < 0 


(c) Pi < 0 and P 2 < 0 


FIG. 1: Non-degenerate case. Possible configurations for the null surfaces of the metrics: 
0 ^ (red); g^ b (blue); background metric (meshed gray). 


A closer inspection of (15) reveals that the effective cones are characterized by the signs of the 


coefficients (fl 6 |). We introduce to this matter, two useful quantities 


fli := otifii (for i = l,2) (17) 

and present a table illustrating all possible sign combinations, 



sign (a;) 

sign (hi) 

signature of g, 

timelike vector 

g\t a t b 

v ab 

sign(fli) 

0 

+ 

+ 

(+> ~> — i — ) 

t a = ( 1 , 0 , 0 , 0 ) 

aj -1 > 0 

+ 

a) 

- 

- 

(—> +1 +) +) 

t a = ( 1 , 0 , 0 , 0 ) 

aj " 1 < 0 

+ 

in) 

+ 

- 

(+> +> +> ~) 

1 ° = ( 0 , 0 , 0 , 1 ) 

-ai " 1 < 0 

- 

iv) 

- 

+ 

(~> —3 —1 +) 

t a = ( 0 , 0 , 0 , 1 ) 

—a ]" 1 > 0 

- 
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Note that the signatures of the effective metrics are not determined by their Lorentzian character. 
In particular, the norm of timelike vectors depends on the background field and does not have a 
preferred sign Q The cones, in the other hand, are clearly independent on how these signatures 
turn out; and the same will apply to all our future results, as we shall see later. 


Proposition 1. 

(i) If lit > 0, then C 0i flC^ / 0. (where rj ab := Diag (1, —1, —1, — 1)). 

(ii) If Hi < 0, then C Bi n C a / 0. (where a ab := Diag (— 1 , — 1 , — 1 , 1 )). 


Therefore, there are only three qualitatively different configurations for the effective cones in T v M: 
either the three cones intersect (Fig|la[); the two effective cones intersect each other, but do not 
intersect the background cone (Figjlx]); or they don’t intersect (Fig |lb[ ). Moreover, when the 
effective cones do intersect each other, it turns out that is always possible to single one as being 
included inside the other. Thus, the following holds 


Proposition 2. Whenever the effective cones intersect each other, i.e: C Bl n C S2 f 0, then 

M C* S1 c C 0 * 2 

(ii) C S2 C C B1 

Proof. We begin our proof with the important observation 

Lemma 1. The following inequalities appear as algebraic consequences of the definitions, 


a i/3 2 < 1 < a 2 Pi 


(18) 


Indeed, starting from the full expressions, 

ai/?2 = 1 + \ ((FNi + GN 2 ) - yjN* + N$VF 2 + Cp'j (19) 

a 2 Pi = 1 + \ {[FN x + GN 2 ) + yjNl + 1V 2 VP 2 + G 2 ^j 

and the trivial inequality a 2 /3i > oti(3 2 one obtains, after some simple manipulations, 

- (1 - «i/3 2 ) (1 - a 2 Pi) = ^ (GN i - FN 2 ) 2 > 0. (20) 

The positivity of the left hand side, together with condition a 2 Pi > a\/3 2 , forces the inequalities 
of the lemma to hold. 


7 One could in principle redefine the metrics to fix both signatures in accordance with the convention chosen for the 
background metric, but these redefinitions are going to depend generically on the background fields. 
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We now define two auxiliary quantities which capture the notion of “how much the given cone 
(or co-cone) opens in any direction orthogonal to the PND’s plane” in this particular frame. 


7i : = VWJMi 


( 21 ) 


In spite of the fact that this is a coordinate-dependent notion it allows us to compare the cones 
and tell which of them is included in the other. The latter statement has a well defined geomet¬ 
rical meaning and directly extrapolates to any other frame. The argument goes as follows. The 
assumption C gi n C S2 / 0 translates into 


signal) = sign(Q 2 ) or equivalently, 0 < = aifactiPi 


( 22 ) 


which, together with (18), implies aifc > 0 and, consequently, 71 < 72 . Now, suppose fR > 0 and 


O 2 > 0 and consider a continuous transition from a time-like to a space-like covector (w.r.t. the 
effective metrics) parametrized by x E [0,1] in the form 


n a (x) = (1 — x) (1,0,0,0) T x (0, cos 0, sin cf> 1 0) 


(23) 


with cj) E [0,27r). The idea is to find x\ and X 2 such that the covector becomes null, i.e: 
gf b n a (xi)nb(xi ) = 0 (i=l,2). The solutions are given by 


Xi = 


1 + 7 


-1 


(24) 


which determines x\ < X 2 - Conversely, assuming fR < 0 and O 2 < 0 we consider the family 


n a (x) = (1 — x) ( 0 , 0 , 0 , 1 ) + x ( 0 , cos (f>, sin 0 , 0 ) 

and get precisely the same answer i.e. x\ < X 2 - This allows us to conclude that C* { C C* 2 as 
claimed in (i). In order to prove (ii), we basically apply the same strategy. Instead of using the 
reciprocal effective metrics, we now look at the covariant objects g’ a/j and the relevant quantities 
become 7 * := jf 1 . One then concludes that our previous inequality is inverted i.e. x\ > X 2 - The 
latter directly leads us to the desired result (ii). 

□ 


2. Degenerate F a b 

For a degenerate 2-form, the two null directions collapse into a single one (see figure [2|. Roughly, 
this means that, either the cones do not intersect each other, or one of them is included in the 
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othei|^} Again, we refer the reader to appendix A for more details on frames. In particular, one 
can reduce the metrics and inverses to, 

= (*o “ *§) + ^ h % (*o - hf ~ (it\ + tf) 

(25) 

0 i b n a n b = (pi - nfj - e\ (n 0 + n 3 ) 2 - (n\ + nfj 
with s/Oa free parameter, a remaining freedom in our frame choice that we’ll fix in a convenient 
way later on. Note when bi = 0 the effective metric reduce to background metric. 


Proposition 3. For a degenerate F a b, the effective cones always intersect, i.e: FC* 2 f 0, and 

with background metric n C* 2 n C* / 0 . Furthermore, 

V Cl c q 2 

(a) Cg 2 c c ai 

Proof. First, we want to see whether there is a non-empty intersection among the effective co¬ 
cones. The idea is to choose the parameter e “sufficiently small”, so as to construct an explicit 
timelike covector w.r.t. both reciprocal effective metrics 

Lemma 2. //e 4 < min^i^ (|6j| _1 ), there exists a covector which is time-like with respect to both 
reciprocal effective metrics. 


Proof. Let us consider, in the present frame, four linearly independent covectors: 

Ta = (1,0,0,0); x a = (0,1,0,0); y a = (0,0,1,0); z a = (0,0,0,1). One obtains, 

di b r a T b = (l - £%) > 0 Qi b x a x b = -1 

Qi b Vay b = -1 8i b z a z b = - (l + £%) < 0 

Therefore, there exists a common time-like covector r a , and thus 6 ^ n C* 2 O C* f 0. □ 

To prove (i), we proceed in the same lines as in the non-degenerate case. Firstly, from the 
definitions of b\ and 62 there follows 62 < b\. We then consider a continuous transformation from 
a timelike to a space-like covector, parametrized by x 6 [ 0 , 1 ] in the form: 

n a (x) = (1 - x) r a + xz a (26) 


with r a and z a as defined above. Again, we look for x\ and X 2 within the range [0,1], for which 
the covector becomes null w. r. t. both metrics, i.e: gf b n a (xi)nb(xi) = 0 (for * = 1,2). Then, 


Xi = 



(27) 


this do not exclude the possibility they coincide. 
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which implies x\ < ; Consequently, C* x C C* 0 |^J We remark that (ii) can be easily obtained in 

the same lines. 



(a) 0 < 62 < &i (b) 62 < 0 < b\ (c) 62 < bi < 0 

FIG. 2: Degenerate case. Possible configurations for the null surfaces of the metrics: 
g) 6 (red); (blue); background metric (meshed gray). 


III. HYPERBOLIZATIONS 


In order to analyze the evolutionary properties of the system © more closely, we shall recast 
it in the geometrical framework suggested by Geroch [lj: 


K A m a (x, <L)d m $“ + J A (x, $) = 0, 


(28) 


where K A m (x, <b) is called the principal part and J j 4 (x, < F) stands for senri-linear contributions 
(whose explicit form is unnecessary for our discussion here). In this expression capital Latin 
indices, A, stand for the space of tensorial equations, lower Latin indices, m, for space-time indices 
and Greek indices, a, for multi-tensorial unknowns. 


Remark 3. Typically, smooth solutions of (28) are interpreted as cross-sections <h“(x) over a 
smooth fibre bundle B, with points k, = ( x a , <L“) and we interpret the fibre over x a as the space of 
allowed physical states at x a , i.e., as the space of possible field-values at that point. 


Notice that, in order to conclude this, it is enough to start from the common time-like covector and “move” in any 
direction,since we already know there is one (and just one!) common null direction among the effective metrics. 
Thus, we argue it is not possible to find different results in different directions. As observed in the beginning of 
this section, once they have non-empty intersection, one of the cones must be included on the other. 






















13 


Definition 1. By a hyperbolization of (28) over a sub-manifold S E M, we mean a smooth 


symmetrizer h A a such that: 


1. the field h A a KJ( n p is symmetric in a, /3 in S; 

2. there exists a covector n m E T* S such that h A a Kis positive-definite. 

If a system of first-order PDEs admits a symmetrizer satisfying the above conditions, we say that 

it is symmetric hyperbolic. 


Once a hyperbolizer exists, standard theorems apply and we know that given any smooth data in 
a hypesurface such that n a is normal to it a local solution for it would exists. Notice that since the 
set of co-vectors n a for which the hyperbolizer is positive is open, we can always choose them in a 
neighborhood of a point so that they are surface forming. We introduce also, for later convenience, 
the notion of physical propagation in this context. We denote by C* H the collection of all covectors 
n a satisfying condition (2.) above. Then, Cf is a non-empty open convex cone. 

Definition 2. The “signal-propagation directions” will be given by all tangent vectors p a , such that 
p a n a > 0, V n a E C* H . The set of p a also forms a (non-empty) closed convex cone denoted by Ch, 
the “dual cone ” of C* H . 

Remark 4. These cones could depend, in principle, on the hyperbolization selected. But it turns 
out that, for most physical examples, these cones are essentially independent of hyperbolization. 


For NLED, we declare <h a —y F bc , and a closer inspection of ([3]) allows us to read-off the principal 
symbol 


K m 
1X A a 


9a m bc + Fa m {HiFbc + 6 F bc )+ F a m (f 2 F bc + & F bc ) 


- m 
) la be 


(29) 


with g abc d := \{g a c9bd ~ 9adSbc )• I n what follows we shall see that symmetric hyperbolicity holds 
for NLED under some basic assumptions. Basically, the latter are conditions on the Lagrangian, 
its derivatives and field strengths. 


A. Symmetrizer 


Our first task is to find a symmetrizer for (29). In other words, we look for a h A a such that 

• X ab and <5<l“ 


8& 0 (h A a Kjf n p)8$P is symmetric in and dT. Making the identifications 5^ 
Y ab , with X ab and Y ab arbitrary anti-symmetric tensors, one has 


= ( - 


Y a m + A Y F a m + B Y Ff 


Y ' 

J 1 a 


(30) 
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where 


:= [6(F.Y)+6(F .Y)] 


B y := [&(F.Y)+&(F .Y)] 


(31) 


and X.Y = X ab Y a b. Note that the above relations depend on the background field, the background 
metric and on the particular Lagrangian theory. It is natural to expect that h A a depends also on 
these quantities. Also, when the theory is linear, i.e. A Y = B Y = 0, we know that 


h A J<S> a = X\ , - X a q ) t q 


(32) 


where t q is an auxiliary smooth vector field. Therefore, the full nonlinear symmetrizer must: i) 


reduce to (32) in the linear case and ii) depend on at least one smooth vector field t q (x ). We claim 
that it is given by the formula 


h A J<S>° = ( x° a , - X* -Ax n +5xn t q . 


(33) 


To show that it is indeed a symmetrizer, we first multiply (33) by (30). It follows, 
5<S> a (h A a K A m 8)^' = ( - X a q Y a m - A Y X a q F a m - B Y X% F a m - 


- X* q Y* m -A x F* a q Y* m +B^F a q Y> )t q . 


Recalling that any pair of anti-symmetric tensors satisfies 


X aq Y am = --(X.Y)C + X am Y aq , 


X aq Y am = -(X ,Y)S 1 q — X am Y aq , 


(34) 


we obtain 


5^ a {h A a K^ p )5¥ = (M q m + N q m + L q m )t q , (35) 

with 

M q m = +(X q a Y a m + Y q a X a m ) + \ (X.Y)<5, m , 

N q m = -(A Y A- + A X Y“) F a m - (B Y X a q + B x Y a q ) F>, 

L q m = +{6(F.X)(F.Y) +6[(F -X)(F.Y) + (F .Y)(F.X)] + &(F ,X)(F .Y)}5 q m /2, 

which are symmetric quantities in X and Y. Note that this result is general and does not depend 
on the specific form of the Lagrangian. Note also that the symmetrizer splits into a main term, 
which coincides with Maxwell’s symmetrizer and two nonlinear terms involving the background 
2 -form and its dual. 
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B. Positive definiteness 


We now investigate when the symmetrizer constitutes a hyperbolization of the equations of 
motion. For the sake of conciseness, let us define the symmetric object 

H a p(t,n) = h A a {t)K (36) 

We emphasize here the linear dependence of this object with respect to both the vector t a (in the 
symmetrizer) and the covector n m , a fact that will be important later on. We shall see that the 
admissible range for the quantities (t q ,n m ) has a nice geometrical interpretation in terms of the 
effective metrics. 

According to our previous definition, the system will be symmetric hyperbolic if H a p(t,n) 
constitutes a positive definite bi-linear form, i.e. 

$ a H a p(t,n)<f>P > 0 (37) 


for any non-zero <h Q . Here, <3?" represents an arbitrary two-form, and as such can be thought as 
a vector in M 6 . Therefore, the family of symmetric maps H a p(t,n) : M 6 —> M. One can build a 
natural basis for this space by taking the six possible anti-symmetrized pairs of basis-elements in 


T*M. 


From (35) we get 


H a /3(t, n) = H a p(t, n)\ M + H q/ 3 (t,n)\ N + H a/3 (t,n)\ L 


(38) 


A tedious but straightforward calculation yields 

Hap(t, fl) ^ 9abq[c^ rf] T 9cdq[a^ ft] T 9abcd 3 q ^Fn m , 

n)\ N = ((£1 F ab + £ 2 Fab )g q[ cF d] m + (£iF cd + £ 2 F cd )g q { a F b ™ + 

+ (&F a b + 6 F ab )g q [ c F d ™ + (£ 2 F cd + £3 F cd )g q [ a F b ™ ) t q n, 
H Q p(t, n)\ L = - (iiF ab F cd + £ 2 ( F ab F cd + F ab F cd ) + £3 F ab F cd j (t.n). 


We now address the following: What are the conditions on the particular NLED theory (and 


fields) in order to have at least a pair ( t a ,n a ) satisfying the positivity requirement (37). Among 


the symmetric hyperbolic cases, what are the sets of admissible t a giving rise to a hyperbolization 
and what are the associated propagation cones? The answers to these questions are provided by 
the following theorems 
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Theorem 1 . The system is symmetric hyperbolic iff the effective cones (co-cones) have a non¬ 
empty intersection, i.e. C 01 n C S2 / 0 (C* x n C* 2 / $). This will be the case whenever 


ai& = 1 + ^ ( ( FN i + GN i) ~ \/ N i + N 2 VF 2 + G 2 ^j > 0 


(39) 


The strategy behind our proof is as follows: write H a p(t, n ) in a preferred representation and build 
a mixed matrix T°f (by means of an auxiliary Riemannian inner product p a p of M 6 ), such that: 

H a 0 = Pa-jT 1 ^. (40) 

In particular, we choose p a p = 5 a p, with 8 a p the six-dimensional Kronecker delta. We then look 
for the positiveness of all the eigenvalues of T“ 7 which is essentialy equivalent to the positive 
definiteness of It turns out that the above requirement imposes a restriction on the allowed 

values of the background invariants (and on the theories, through f t ): such restriction is precisely 


(39), and it is equivalent to the statement that the effective cones do intersect each other. This 


result reveals an interesting geometrical aspect of NLED regarding hyperbolicity. Also, it provides 
a simple diagnostic tool for computing whether such geometrical property holds in a particular 


situation. Note that (39) is a local condition which contains information on the particular theory, 


but also depends on the background solution and space-time point one is looking at. Thus, it 
might be possible to have theories which are symmetric hyperbolic only for a reduced subset of 
field configurations. 


Theorem 2. A theory satisfying (39) admits a collection of symmetrizers parametrized by vectors 


t a such that t a € C Bl n C 02 = C S2 . Regardless of the particular choice, the resulting propagation 


cone Ch is given by the closure of the union of the effective cones, i.e: Ch = C 01 U C 02 = C, 


-'0i 


The starting point underlying the second proof is the construction of an explicit pair (t“,n") 
satisfying the positivity condition. We then show this particular couple necessarily lies in the 
intersections of the cones (co-cones) engendered by the effective metrics. Then, by looking at the 

10 The reason to introduce p a p is simply because it only makes sense to talk about eigenvalues when the operator 
acts from one vector space to itself. It might sound as a rather technical subtlety, but is not, since there is not 
unique (or natural) metric to raise the index here. There is an arbitrariness involved on the inner product and the 
resulting eigenvalues are not going to be covariantly definded in general; one is implicitly introducing a coordinate 
dependence on the election of a particular p a p- Nevertheless, the positive character of the eigenvalues is in fact 
invariant and it will guarantee the positive definiteness of H af 3, as long as p a p is an inner product. 
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determinant of T“ 7 , we will argue how much we can extend the vector £“ and the covector n° 
without loosing the positive character of the eigenvalues. It turns out, the limits for extending 
them are just the closures of the above-mentioned intersections. Finally, once one finds all possible 
covectors n a , a simple computation yields the propagation cone Ch, which we find to be Ch = 
C gi U C g2 and, as a consequence of proposition [ 2 J there follows Ch = C gi . 

IV. PROOF OF THE MAIN THEOREMS 

To begin with, note that H a p(t, n ) is linear w.r.t. t a and n a . Thus, once a pair (£“, n°) satisfying 
Hap(t 0 ,n 0 ) > 0 is found, then (by keeping fixed one of them, say £“), it will exist a neighborhood 
(around the other, n°) for which H a » is still positive definite. These neighborhoods necessarily 
define open and convex sets which characterize them as cones according to our definition. 


A. Non-degenerate F a b 


We start by writing H a p(t,n) in a convenient representation. Our choice is irrelevant from the 
conceptual perspective, but important from the operational point of view. We rely on the same 
frame used in section II-C and consider as a basis for M 6 all anti-symmetrized pairs of the covectors 
T a ,x a ,y a ,z a (see the appendix A). A well established algebraic result [55], states that the positive 
definiteness of H a p(t,n) is equivalent to the positivity of all eigenvalues of a matrix T°p, related 


via (40), for any Riemannian inner product p a p. 

Thus, we seek for a specific pair (£“,n°), rendering all eigenvalues positive. Unfortunately, we 
were not able to explicitly calculate them for the most general pair. However, assuming that £“ and 
n° lie within the plane defined by the principal null directions, calculations simplify considerably. 
In the chosen frame this hypothesis is equivalent to, 


to = {to, 0,0, £ 3 ) ! K = (n 0 , 0, 0, n 3 ) 


For such pairs, the six eigenvalues A,; were computed using Mathematica, and are given by, 


Ai, 2 = (no - n 3 ) (£0 + £ 3 ) 
A 3 ,4 = (n 0 + n 3 ) (t 0 - £ 3 ) 
A5 = (n-oto + n 3 £ 3 ) a\a2 
A 6 = (n 0 £o + n 3 £ 3 ) /?i/ 3 2 
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Consequently, A 5 and Xq will be positive if sign(aia. 2 ) = sign(/3 1 /^ 2 )- Equivalently, we have 

0 < ai<a;2/3i/32 = H 1 H 2 

These relations imply the following: i) signify 1 ) = signiflf) which, as we have seen, means that 
the two cones have a non-empty intersection; ii) recalling that aifo < 1 < « 2 / 5 i (from lemma 1 in 


section ITC), one has ai /?2 > 0, which is precisely expression (39) from theorem 1. Under these 
assumptions, there are only two possible cases to analyze: fl* > 0 , which corresponds to figure la 
(section ITC); < 0 , as illustrated in figure [TcJ These two cases are considered separately below, 

1. If Qi > 0, we choose = (1, 0,0, 0) and n° a = (1,0,0,0). And we get, 


'1,2, 3,4 


= 1 


A 5 — aqa :2 >0 ; Ag — j3 1/?2 > 0 


2. If < 0, we choose £“ = (0, 0, 0,1) and n° = (0, 0, 0,1). Obtaining, (once the symmetrizer 
is multiplied by (— 1 )), 

Ai,2,3,4 = 1 ; A 5 = — aqa :2 >0 ; Ag = > 0 


Thus, we conclude that (39) is a sufficient condition for the system to be symmetric hyperbolic. 


To prove it is also necessary , we still need to justify the restriction of (£“, n°) to those lying on the 
PND’s plane. We will return briefly to complete this part of the proof. For the time being, we now 
concentrate on the question regarding how far the neighborhoods of (£“, and n°) can be extended, 
while still preserving the positivity condition. We already discard the space-like part (with respect 
to the effective metrics) of the null plane, for in this case some eigenvalues become negative. 

The determinant of T°p was calculated using Mathematica and is given by, 

„2 j-CL+b^ 


det (T°p) = Oi0 2 (stn a n b ') ( n a t a f (dl b t a t b ^j (&lbt a t l 

= A1A2...A6 


(41) 


for generic vector t a and covector n a ■ This result is at the base of the following discussions. 

First, notice that the vector £“ (and covector n°) we find above, belongs to the intersection of 
the two cones (co-cones). In other words, t“ 6 C Sl D C g2 and n° € C'f D C* 2 [^J Now, whenever an 
eigenvalue becomes zero, one of the following must hold: (a) n a is a null covector of either or 
gf 6 ; (b) t a is a null vector of either g^ fe or g^ b ; (c) t a n a = 0 . 


11 


In our frames, this can be seen in a rather direct way for each of the two situations, namely, fl; > 0 and fl; < 0 . 
That is, why we believe it is not necessary to further justify this statement. 
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Proposition 4. If t a E C gi D C S2 and n a E C* x n C* 2 , then t a n a > 0. 

Proof. We rely on previous results from section II-C. In particular, proposition [2j which states: (i) 
C'fli C C 02 and (ii) C 02 C C gi . Thus, 


C ai H Cg 2 — Cg2 


C.! n ^ 


It is straightforward to see now that any covector n a E C 01 C C 02 will satisfy (recalling the 
definition of dual cone) that t a n a > 0, for any vector t a E C 02 . □ 


From this result we conclude that the set of all the t a for which the symmetrizer is positive 
is just C gi FI C s 2 = (7 02 ; and similarly, that the set of all n a is given by (7 01 n C 02 = C 01 . As a 
corollary, and according to Geroch’s definition of a propagation cone (we gave in section III, def. 
2 ), we find: 


Ch — C gi — C gi U C S2 (42) 

This concludes the proof of theorem 2 for a non-degenerate F a b. To complete the demonstration 
of THM 1 we proceed by contradiction. 


To show (39) is also a necessary condition for (symmetric) hyperbolicity, we assume C' 0i nC 02 = 0 
and that there exists a pair (t“,n°) satisfying: A* > 0 , Vi = 1,2, .., 6 . This will lead us to a 
contradiction. Indeed, if (or n° a ) lies outside both cones (co-cones), then the resulting set of 


allowed vectors (covectors) acoording to (41) does not configure a convex cone. In fact, Eq. (41) 
directly implies that, if such a vector (covector) existed one would be able to keep the positivity 
of the symmetrizer by reflecting the corresponding vector through the preferred plane. However, 
it turns out that their sum would fall into a part of the null plane in which the system is not 
hyperbolic. Thus, the set of vectors wouldn’t characterize a cone according to our definition, which 
is a contradiction. Therefore, both i“ and n° must belong to one of their respective cones/co- 
cones. The contradiction came from assuming one could find a pair (t“,n°) satisfying: Aj > 


0 , Vz = 1,2, ..,6 for the cases in which the cones do not intersect. Thus, (39) is also a necessary 


condition which finishes the proof of THM 1 (for the non-degenerate cases). 
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B. Degenerate F a t 

In order to complete our proofs, we consider here the cases where the background field is 
degenerate. It turns out that these cases are much simpler, since (as we have already seen) there 
is always a non-empty intersection of the two cones. Thus, it only remains to be proven that there 
exist a common timelike vector and covector < (respect to both effective metrics) for which the 
six eigenvalues A, of T c g are positive. We can do this explicitly, using the results from section II-C. 
Recall that a common time-like vector and covector were found, which in our particular frame 
reads 


£ = ( 1 , 0 , 0 , 0 ) ; < = ( 1 , 0 , 0 , 0 ) 

Computing the eigenvalues for this pair one obtains, 

Ai = 1 + e 4 |i>i| > 0 

A 2 = 1 - £ 4 |&i| >0 
A3 = 1 + e 4 1621 > 0 
A 4 = 1 — e 4 I& 2 I >0 
A5 = -^6 = 1 


Then, for the second part of the proof, we follow the same lines as for the non-degenerate case. 
Notice that expressions (41) and proposition [4] also apply in the degenerate case. In addition, we 
have provided (in section II-C) with the analog of proposition [2j namely, proposition [3j Therefore, 
everything follows identically as before and as such concludes the proofs of theorems 1 and 2 . 


V. CONSTRAINTS 

The symmetrizer h A a may be understood as map from the space of equations (indexed by “A ”) 
to the space of unknowns (indexed by “a”). In other words, it selects from the entire set of first 
order equations some combinations of equations which we can evolve along some direction which we 
usually relate to time. What are the remaining equations the symmetrizer does not capture? For 
the system to be consistent they should not be of the evolution type, for in that case they are either 
linear combination of the ones already selected by the symmetrizer or they would be incompatible 
with the previously chosen evolution. In other words, they must be satisfied automatically once 
they are satisfied initially i.e. they should be what we normally call the constraints. In Geroch’s 
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formalism a constraint is a tensor c An such that 


c Mn Kl n) a = 0 


(43) 


When this formalism is applied to the equations of nonlinear electrodynamics we obtain a linear 


space of constraints characterized by vectors (x, y ) in M 2 of the form c An = ( xg an , yg an ). To check 


that this c An does indeed satisfy (43), we combine it with the principal symbol to obtain 
1 


An 


k a « = -x \ 2 g 


nmbc 


+ F nm {^F bc + £ 2 F bc )+ F 


bc\ 


^F bc + ^F bc )} + -yr, 


1 


nmbc 


(44) 


which is anti-symmetric in the quantities n and m. The constraints are complete in the sense that 
the dimension of evolution equations provided by the symmetrizer plus the dimension of constraint 
equations give the correct number of PDEs. Note that, contrarily to KJ 71 , the tensor c An does 
not depend on the electromagnetic field F a b and coincides with those of the linear theory. We now 
show that this two-dimensional system is integrable i.e. that it satisfies 


Vn(c 


,An 


K A m a V m <S> a ) = V n (c 


.An 


K A m a)Vm<S> a + V n V m ( C ^K A m Q T“) = 0. 


(45) 


identically. The second-derivative term drops out, as a consequence of (43), and so we are left with 
an algebraic equation in the first derivatives, of the form. 


d 


(■ c An K A m a ) = 0. 


(46) 


This is an integrability condition for the equations as a whole. If it holds as a trivial algebraic 
consequence of the equations of motion we say that our constraint is integrable. To show that this 
is indeed the case for any NLE derived from a Lagrangian it is convenient to rewrite the equations 
of motion in the form 


w m (kJTA a ) = o 


with I\ , 


t A a -A 9 a mbc F Fg y a rnbc , y a mbc )• Multiplying by c An V n and noting that c 

commutes with the derivatives we obtain for the left-hand side 


(47) 

,An 


\7 n \7 m (c An K A m a <S> a ). 


(48) 


Now, because c An K A n is also anti-symmetric in n and m, this quantity is identically zero. Thus, 


equation (46) holds trivially and the constraints are therefore integrable, i.e. they remain true 
on the whole domain of dependence provided they were so at initial time. When the constraints 
are integrable, by studying the properties of the compound hyperbolic PDE system it follows 
that all the well-posedness results for symmetric hyperbolic systems also apply in the presence of 
constraints. 
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VI. EXAMPLES 


In this section we discuss several examples illustrating the power of the results discussed so far. 
The latter serve as simple diagnostic tools in testing whether well-posedness (hyperbolicity) holds 
and what are the associated physical propagation cones. In particular, if ctifa > 0, theorem (jTj) 
guarantees a non-trivial intersection between the effective cones and so, hyperbolicity. When this 
condition is fulfilled we shall check whether these cones intersect the background cone or not and 
whether the propagation speeds (given by C 01 ) are sub or super-luminal. 

We will perform this analysis based on 71 for non-degenerate cases and 61 for degenerate ones. 
As can be seen from propositions ([Tj) and ([2]), when 71 < 1, then C v (or C a ) C ( 7 gi ; when 71 = 1, 
then C gi = C v (or C a ) the effective metric becomes in the background (of a rotation of it) ; finally 
in the sub-luminal case 1 < 71 then C gi C C v or (C a ). Notice that since all the metrics share 
at least one null direction along that direction all propagation speeds coincide, so we always have 
some directions with speed of light propagation. 

For degenerate cases, the causal possibilities are giving by 61, as it can be seen from proposition 
Q if b{ < (1 then C Sl C C v ] if b\ = 0 then C v = C gi ; if 0 < 61 then C v C (7 gi 

In the next examples we shall use a frame in which E || H (A3), in that case, _ ^2 


and VW+&+F = H 2 


A. Born-Infeld 

The Born-Infeld theory is the paramount example of non-linear electrodynamics. It was pro¬ 
posed to remove the divergence of the electrons self energy at the classical level. The idea was to 
use a non-linear generalization to Maxwell's theory, which deviates from it at very strong fields. 
It naturally introduces a cut off /3 limiting of maximum electric fields around a static charge, thus 
avoiding the singularity at r —> 0. The Lagrangian expression is, 

11 = 1,1 (\j 1 + W~Tk~ a + 1 ) 

One interesting property of this theory is the absence of birefringence, since N\ = N 2 = 0, m 
the two effective metrics are identical and so are the propagation speeds of all physical modes. In 
this case the cones intersection is obvious, and the systems is symmetric hyperbolic. In addition 
the propagation cone is contained in the light cone of the space-time metric, so propagation speeds 
are lower or equal to the speed of light. 
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• Non-degenerate case F 2 + G 2 / 0 

Before starting the hyperbolicity analysis it is important to note that when f3 2 —> '/ f2 +g 2 ^ f , 
|p —> oo and if f3 2 < — F ' 2 ^ 4 G — then C became complex. Thus, we shall restrict attention 
to the range = e 2 < /3 2 , which is consistent with the original idea of a limited 

electric field strength in the Born-Infeld theory. 

The effective metrics are conformally related, g% b = ( ~ G ^F+i^ 2 ^ ) an( ^ ^ easy to check 
that 0 < ai/?2 = 1 (and 0 < aia2/3i/32 = 1) so, as we have said, the theory is symmetric 
hyperbolic. 

( 2B 2 -\-fS 2 

In addition ^ = 4 16 ^ 2+#2 ) ^2 _ E 2 ) > 0 thus when E 2 < /3 2 , the effective metrics have 
cone intersections with the background metric. 

We compute now 



and conclude that 1 < yf i.e. it is a sub-luminal case, only propagation speeds up light are 
allowed. 

In the limit of weak fields, E 2 , H 2 « /3 2 , 71 —> 1, so, as expected, the theory becomes closer 
to Maxwell’s. 


• Degenerate case F 2 + G 2 = 0 


As we have established, 
intersection. In this case, 
are lower than light. 


degenerate cases are always hyperbolic, we can check the cone 
using (25), gf b = g% b , with 2 = — J? < 0 the propagation speeds 


B. Toy Model 1 

The present example, like the Born-Infeld one, has only one effective metric (Ad = N 2 = 0), 
so it is symmetric hyperbolic for any value of the fields, but unlike the former the effective metric 
cone never intersects the background metric cone (see figure [lc]). Its Lagrangian is given by, 


C = 


F 

G 
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The theory is not defined for G = 0 since £ 2 ,^ 3 ,a,Q, and R blow up in this limit, so the 
degenerate case will not be discussed. 


The effective metric definitions (12) we are using also go bad at F = 0, but they can be 
re-scaled (see footnote [ 2 ]) so that they become finite. We can use that new effective metrics in 
place of the former without affecting the results of this paper. We obtain, g^ b = {—^)g\ b so 
they are conformally related, and therefore share the same cone implying the system is symmetric 
hyperbolic. In addition Hi = i = —H 2 E 2 < 0, so the effective metrics cone have no intersection 
with the background metrics cone. This implies that initial data must be given in a tinre-like hyper¬ 
surface with respect the background metric, but space-like for the effective metrics, evolution occurs 
in temporal directions with respect to the last one. 


C. Electrodynamics from Kaluza-Klein theory 


We shall analyze three examples introduced in m • The authors start from the Kaluza Klein 
metrics in 5 = d + 1 dimensions, and adding a Gauss-Bonet terms (that in five dimension is not a 
topological invariant) to the Einstein Hilbert action. They arrive to an effective electrodynamics 
theory in d = 3 + 1 dimensions. The resulting Lagrangian is 

C = -\ F + ^[(b- 1) F 2 - ^G 2 ) (49) 

with 7 a perturbation parameter from the Lagrangian, associated with Gauss-Bonet terms, it will 
be interpreted as a function of physical quantities ( e,m e ,h,c ) and 6 a parameter associated to a 
term in the action quadratic in Ricci scalar. This last parameter is chosen in order to avoid ghost 
propagation, and will give rise to very different electromagnetic theories. 


The Symmetric Hyperbolic condition is 


aiQ!2/3i/32 


(CF (1 - 46) 7 + 4 ) 2 - (5 7 - 267) 2 (F 2 + G 2 )) 
( 2 E 7 — 2 E 67 + 4) ? 


We shall check under which conditions this is fulfilled in the following examples. 


1. Gauss-Bonnet electrodynamics 

For this theory 6=1 and it is interpreted as the first order string theory corrections to the 
general relativity [58J 
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The Symmetric hyperbolic condition becomes 

aia 2 /Si /^2 = (l — 3 H 2 ^) (l + 3~/E 2 ) > 0 (50) 


Because of the sign( 7 ) is not defined, and this give rise to different hyperbolicity conditions 
and different effective metrics, we need to study each case in particular. 


If 7 > 0 condition (50) implies H 2 < 


37 


— Non degenerate case 

The effective metrics have cone intersections with background metric ones because 

Hi = (1 - 3H 2 ^) (1 + 37^ 2 ) > 0 


In addition, propagation faster than light are allowed, this is so even for the degenerate 
case, as we will check 


1 > 71 


1 - 3H 2 '-/ 
1 + 37 E 2 


> 0 


(51) 


— Degenerate case 


!>i = 37 > 0 


If 7 < 0 the dominant energy condition is satisfied and condition (50) implies E 2 < 7 ^ in 


concordance to Gibbons and Herdeiro. 

The effective metric 0 i is conformal to the background metric, so they have the same cone 


— Non degenerate case: gf b = (1 — 3 ^H 2 + 3^E 2 )g ab 

— Degenerate case: gf b = r] ab 


2. Born-Infeld to second order 


For b = — 1 and 7 oc /? , equation (49) approximate the BI Lagrangian to second order, so we 
recover birefringence. 


• Non degenerate case 


The condition for hyperbolicity is, 


atia2/3iP2 


16(1 + 3 7FI 2 ) (I-37F; 2 ) 
((l + 37 tf 2 ) + (l- 3 7 £ 2 )) 2 
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Then you can prove the last expression is positive when 

i (i+ 37 if 2 ) 

3 (1- 3yE 2 ) 


This implies non empty cones intersection of background with effective metrics 

( ( 1+3 -yH>)\ 

_ ( 37 H 2 + 3 "fE 2 + 2 )“ \ 6 ( 1 - 37 &) ) ^ 

= (3-,m - 3 7 -E 2 + 2) 2 (l + ii±ggl) > 0 


Moreover 


71 


o (l+37-ff 2 ) 

6 (I-37 E 2 ) 

1 1 (1+37 m 
1 (1-37 E 2 ) 


In the symmetric hyperbolic range, 


If 0 < 7 => 1 < 


(l+37^ 2 ) 
(l— 37 -E 2 ) 


< 3 =^ 0 < 7 i <1 and super-luminal propagation is allowed 


If 7 = 0 => the system reduce to Maxwell theory and propagations coincide. 


If 7 < 0 =7 l < 


(1+37-ff 2 ) 

(1-37E 2 ) 


< 1 => 1 < 71 and propagation up to speed of light are allowed 


• Degenerate case 


h = 37 

If 7 > 0 then b\ > 0 and propagation’s faster than light are allowed, 

If 7 = 0 then b\ = 0 and we recover Maxwell theory 

If 7 < 0 then 61 < 0 and propagation’s up to speed of light are allowed (propagation for the 

unique null direction of F a b will be to speed of light) 


3. Euler-Heisenberg 


For b = | and 7 oc a the fine structure constant, the Euler-Heisenberg theory becomes the 
effective Lagrangian for QED due to one loop corrections [3] 


• Non degenerate case 

( _ ^ - 3~,E 2 _y /6 (7 + 37-+) t / (7 + 3jH 2 ) 

V - 6(7 + 37// 2 )—6(7 — 3yE 2 ) + 63/ I 5 (7 - 3-/E 2 ) I { (7 - 3-/E 2 ) 6 J 


011 CX 2 P 1 P 2 = 270 
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Then the systems is symmetric hyperbolic when 

5 ^ (7 + 3y H 2 ) ^ 6 

6 < (7 - 3 7 £ 2 ) < 5 

The analysis of the propagation velocity in this case will not be studied due to its difficulty. 


• Degenerate case 


bi 


33 

14 


7 + 


9 

14 


ItI 


If 7 > 0 then b\ > 0 and propagation’s higher than light are allowed 
If 7 = 0 we recovery Maxwell theory 

If 7 < 0 then b\ < 0 and propagation up to speed of light are allowed. 


D. Euler-Heisenberg II 


We present another approach given in [59] , it is a strong field approximation for the Euler- 
Heisenberg action, with the following Lagrangian 

L = --kF\FG\* 

4 

with k = E ~ 25 , a critical field and 5 = ^ ^ 4 nhce Q ) f° r spinors QED and 5,g = f for scalar QED. 
The degenerate case are not well define because some quantities blow up. 

For the non degenerate case, we define 

- (65 4 + 44<5 3 + 92<5 2 + 645 + 16) ± yjl6 (5 + 2 ) 3 (25 + l ) 2 (65 2 + 55 + 2) 

V± = 2 (165 + 285 2 — 95 4 ) ' 

Thus the system is symmetric hyperbolic when 
If (l65 + 285 2 - 95 4 ) >0 

p2 p2 

Q2<y- ° y+ < q2 


If (165 + 285 2 - 95 4 ) < 0 

F 2 

y- < Q2 <y+ 
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E. Toy model 2 

We consider an arbitrary Lagrangian as function only of F, 

C = £ (F). 

Then 6 = £3 = 0 and 61 := (£1 + |Ci |)M = (6 - |Ci|) • 

• Non degenerate case 

We will see that the hyperbolicity condition is 

1 ^ ^ 1 
2H 2 < ^ 1< 2 E 2 


1. If Ci > 0 


The systems is symmetric hyperbolic if 0 < ai /?2 = 1 — 2 £i E 2 then 


0 < Ci < 


1 

2E 2 


In addition 

1 - 2& E 2 
71 “ 1 + 2Ci H 2 

If Ci = 0 (and 71 = 1) the theory behaves as Maxwell, and the birefringence effect disappear, 
the propagation is given for the background cone. 

If 0 < Cl < then 71 < 1 and propagation faster than speed of light are allowed. 
Moreover 


Qi = (1 — 2Ci E 2 ) (l + 2CiF 2 ) >0 

then the effective metric have cone intersection with background metric. 


2. If Ci < 0 

The systems is symmetric hyperbolic if 0 < ai /?2 = 1 + 2Ci H 2 


1 


2 H 2 


< Ci < 0, 


and cone propagation is given for background cone, 


71 = 1 Hi = 1 . 
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• Degenerate case 

If £1 >0 then b\ = 2£i > 0 and propagation faster than light are allowed 
If Cl < 0 then b\ = 0 the background metrics is the causal cone. 

The particular case C = with ^ = ^7 is similar to the previous Gauss-Bonnet theory. The 
hyperbolicity range coincide under the change E —>• H and H —>• E. 

VII. CONCLUSIONS 


Non-linear generalization of Maxwell’s theory arise naturally in many circumstances, some of 
them were discussed in the examples presented above. In most cases of interest they are generated 
from a variational principle involving a Lagrangian function of the two Lorentz invariant scalars 
one can form from Maxwell’s tensor. It is well known that all these theories have a dispersion 
relation (set of hyper-planes allowing solutions which are constant along them) determined by the 
null vectors of two effective conformal class of Lorentzian metrics, which we call g“ b and g^. These 
cones share a very important property, generically they have a pair of null directions in common 
with the background metric, thus a preferred two plane, containing these two vectors is preferred 
too (in some degenerate (non-generic) cases these two directions collapse into a single one). Being 
null cones of conformal Lorentzian metrics we can define their propagation cones, namely the cone 
of all vectors whose norms with respect to each one of these metrics are positive definite. The 
shared null direction property implies that either the propagation cones are nested, one inside the 
other, or they are not, having no vector in common. 

A first question one might ask when one is face with any one of such generalization it is whether 
they are well posed, namely whether the solutions are continuous functions of their initial data. 
Without this requirement the theories are powerless, they don’t have any predictive power. Using 
the covariant approach to symmetric hyperbolicity introduced by Geroch we developed simple 
criteria for non-linear Electrodynamics theories arising from arbitrary Lagrangians to shield well 
posed system of evolution equations. 

The criteria we found are of two kind, and of course equivalent among each other. One of them 
is geometric in nature, it says that whenever the propagation cones of the two conformal effective 
metrics have a non-empty intersection the theories are symmetric hyperbolic. There are here two 
cases, in one of them the conformal metrics propagation cones have also intersection with the that 
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of the background relativistic metric, in the other not. As can be seen from the examples, Born- 
Infeld, Gauss-Bonnet, Euler-Heisenberg and toy model II have cone intersection with background 
metric, and only the toy model I doesn’t have. In this second case the propagation has the 
particularity that the propagation is along space-like directions with respect to the background 
metric, nevertheless, taken the theory in its own, it is perfectly causal and has an initial value 
formulation in which data must be given along a time-like (with respect to the background metric) 
hyper-surface. In addition, in both cases one of the propagation cones is always inside the other 
so causality is always given by the cone of the metric with the largest propagation cone, which 
according to our definitions is always g^. There are cases where this propagation cone strictly 
contains the background metric cone and so we have propagation speeds larger than ’’light” namely 
larger than those allowed at the background propagation cone. In other cases the smaller cone is 
strictly inside the background cone and we have relativistic causality. 

The other criteria are algebraic, the cones non-empty intersection is equivalent to the positivity 
of the scalar expression aify which can be explicitly computed from the Lagrangian function and 
its derivatives and depends only on the values of the two Lorentz invariant quantities. Whether 
the biggest of the metrics propagation cones is contained or contains the background metric cone 
depends on whether the quantity 71 is bigger or smaller that one for non degenerate cases, and 
on b\ is negative or positive for degenerate cases. The algebraic criteria are very easy to check 
on actual examples. It is important to realize that whether a theory is hyperbolic or not might 
depend on the field strength of it, (the values the invariant scalars take), and so along evolution 
a perfectly nice solution might cease to be well posed. Using the algebraic conditions one could 
characterize the set of theories where hyperbolicity holds for all values of the field. That set of 
theories should be preferred. 

The proof of the above mentioned criteria involves the explicit construction of hyperbolizers, 
thus, when trying to evolve the equations of these theories we provide a set of evolution equations 
which can be safely used. The propagation cones of these symmetrizers coincide with the smaller 
one of the effective metrics and so we recuperate for them the same causality properties as discussed 
above. Thus these hyperbolizations are optimal, they allow for all possible hyper-surfaces where 
initial data can be given. 

We also assert that the constraints arising in the theories are in all cases integrable (in the 
sense of Geroch). In some sense this is so because the constraint structure is close to the one 
of Maxwell. This means that in all cases, if they are satisfied for initial data, and the system is 
hyperbolic, they remain satisfied along evolution as long as the system remains hyperbolic and 
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inside the corresponding domain of dependence. 

Notice that at no point we use energy conditions this is so because hyperbolicity depends on 
second derivative conditions of the Lagrangian, while energy conditions involve only first deriva¬ 
tives. To some extent energy conditions are related (through energy momentum conservation) to 
causality conditions. Their imposition probably would prevent those cases where the effective cones 
have no intersection with the space-time cone. But in those cases one probably would require a 
different energy condition. 


Appendix A: Frames 

In order to facilitate calculations we work in a family of preferred frames, to built it we use the 
principal null directions of F a b- To display it we use spinors |53j-{55], but this is not essential. 

We know that in the spinorial form the electromagnetic held is 

Fab ~> £Ab 4>A'B' + tA’B'^AB (Al) 

with 4> ab = (t>(AB) and e AB = 

1. Non-degenerate case 

In the non-degenerate case, <ft>AB is characterized by two different principal null directions, 

(ftAB = (ft d(A^B) 

where 9aI^ A = 1 and (ft is a complex normalization factor. 

Using these spinors we can construct a (complex) null tetrad, 

k a -A 6 a 0 a ' l a —> ^aUa 1 -A Oa^a 1 Fi a -A 9 aFa (A2) 

where = 1) rn a r] a biri a = — 1 and all other contractions vanish. 

We observe they are eigenvectors of the electromagnetic held, namely: 

F a b k b = Re (0) k a F a b l b = —Re (</>) l a F a b m b = ilm (4>) m a F a b m b = —ilm (</>) fh a 
So by noticing that F = —((ft 2 + (ft 2 ) and G = i(<ft 2 — 0 2 ), it follows that 
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Notice G = Re (<f>) Im ((f) ), and so the signs of Re (<f> ) and Im ((f) ) must be properly fixed according 
to the sign of G. Finally, we build our (real) tetrad as 

r a = -4= (k a + l a ) z a = 4 ? (k a - l a ) 

^ V ' (A3) 

x a _ (j77, a 4- fri a ) y a = (?n a — m a ) 

In this frame the background metric is rj a b = Diag (1,-1,—1,-1); r a is time-like while x a ,y a , z a 
are space-like vectors. The electromagnetic field and its dual read, 


F ■ — 



1 0 - 

-Ei 

-e 2 

-e 3 ^ 

Ei 

0 

h 3 

-h 2 

E 2 - 

-H 3 

0 

Hi 

\ e 3 

Hi 

-Hi 

0 ) 

( 0 

Hi 

H 2 

h 3 

-Hi 

0 

e 3 

— E‘2 

-h 2 

—E 3 

0 

Ei 

\-h 3 

e 2 

-Ei 

0 


( 0 0 0 - 

0 0 Im (4>) 

0 —Im((f>) 0 

^ Re ((f>) 0 0 

/ 0 0 0 
0 0 Re ((f>) 

0 —Re (cf>) 0 

^ —Im ((f)) 0 0 


Re ((f)) ^ 

0 

0 

0 

Im ((f)) ^ 

0 

0 

0 ) 


(A4) 


where it follows that E and H are parallel, both lying on the z a direction. And one can immediately 
see how calculations will become simpler by adopting this frame. 


a. Boost freedom 


Since the extend of the two principal null directions (or corresponding spinors) is arbitrary, 
there is a freedom on the frame choice which does not alter the above form of F a b. If one considers 
the transformation 6 a —> and y A -A ey A , it can be notice that 4>ab, and therefore F a b , remains 


unchanged. Thus, if we choose e real, the null tetrad (A2) changes to 


k a -A 6 a@a' l a —> £ 2 FaFa' fn a —> 6aFa' m > Qa'FA (A5) 

and in the new frame 

= -A (k a + [°) z a = ^ (k a ~ l a ) 

x a = ^ (m a + m a ) y a = -(m a — fh a ) 

So this freedom corresponds to a boost in the null plane defined by the two null directions. The 
choice of e imaginary causes a rotation of the frame components perpendicular to the null plane 
form by the two null directions. Both, the metric r] a b and Maxwell tensor components remain 
invariant. 
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2. Degenerated case 


As before, the electromagnetic tensor takes the spinorial form (Al), but now there is just a 
single null direction associated to it: 

4 >ab = OaOb (A6) 

To complete the frame then, we shall choose an arbitrary null direction ha, such that 9aH A = 1 


and we proceed building a null tetrad like in (A5): 
k a 


^9a0a' l a £ HAHA’ mCL 9 aHa' m 9 A'Ha 


where we set k a r] ab l b = 1, m a "q a f J m a = — 1, and in analogy to the previous case any other contraction 
vanish. With e being a real positive parameter that we can freely pick up 12 
It follows that, 

F ab k b = 0 ; F ah l b = -e 2 (m a + m a ) ; F ab m b = -e 2 k a ; F ab m b = -e 2 k a 


Now we build the frames as in (A3), and get the following expressions for F ab and F* b 

( 0 -£ 2 0 0 ^ 


Fij — 


]?*. — 


' 0 - 

-Ei 

~e 2 

-e 3 

Ei 

0 

h 3 

-III 

E 2 - 

-H 3 

0 

Hi 

\ e 3 

h 2 

-Hi 

0 ) 

( 0 

Hi 

h 2 

h 3 

-Hi 

0 

e 3 

—E 2 

-h 2 

—E 3 

0 

Ei 

\-h 3 

e 2 

-Ei 

0 


e 2 0 0 —£ 2 


0 0 0 0 

y 0 e 2 0 0 J 

( o o £ 2 o N 

0 0 0 0 


—£ 2 0 0 £ 2 


\ 0 0 -£ 2 0 

Thus the vectors E = (£ 2 ,0,0) and H = (0, £ 2 ,0) are orthogonal each other (i.e: G = 0) and with 
equal norms (i.e: F = 0), as they should. 


Appendix B: Degenerate effective metrics 

In this appendix we want to show that if one of the two effective metrics becomes degenerate 
(non-invertible), then the system is not symmetric hyperbolic. Such degenerate cases will occur 


Notice we strongly rely on this freedom in Section II-C.2, particularly on proving lemma |2j 
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whenever one of the variables in (16) becomes zero. From equation (18) we see that only the cases 
= 0 or /?2 = 0 are allowed. When a\ = 0, say, the metric will no longer be invertible. 


However, we stress here that the determinant from expression (41) still factorizes into four metrics 
S 2 6 ) Safe’ flab)i where now Ql b t a t b = /3i (to — t|) is obviously not the inverse of 0 “ b . When /?2 = 0, 
the determinant factorizes into four metrics, ( 0 f b , 02 6 , 0 „ 6 , 0 ^ b ), where Q 2 b t a t b = —«2 (tf + 1 |) 




(a) c*i = 0 and a 2 ,Pi,p 2 ^ 0 


(b) = 0 and ai,a 2 ,/h 7 ^ 0 


FIG. 3: Null surfaces of the metrics are illustrated: 01 ab (red); 02 ab (blue); 7j ab (grey). 


We first analyze in detail the case a\ = 0. Suppose for contradiction there exist n° a and 
t b Q such that the matrix T°^(n°,t 0 ) has all its eigenvalues positive, and let us generically write 
n° = (no, ni, 77 . 2 , ^, 3 ). Then, by following similar arguments to those used on the proofs of the 
main theorems, we will reach a contradiction. The construction goes as follows: we first show it 
is possible to continuously connect n° a with a second covector, h° = (no, — ni, — ri 2 , n 3 ) without 
changing along the path the sign of the eigenvalues. The path we propose is, 

(n 0 ,ni,n 2 ,n 3 ) -)• (no, em, en 2 , n 3 ) -)• (n 0 , em, 0 , n 3 ) ->■ (n 0 , eni, -en 2 , n 3 ) -> (n 0 , 0 , -en 2 , n 3 ) 

—> (no, — eni, — en 2 , n 3 ) —> (no, —ni, —n 2 , n 3 ), for some positive but small parameter e. 

Then, as argued at the beginning of section IV, since both H a p (n°, t a ) and H a p (h°, t a ) will be 
positive dehnite, it turns that H a p (n° + h°,t 0 ) must be be positive definite as well. But, a simple 
computation shows n° + h° = 2 (no, 0 , 0 , n 3 ), which should also be in the cone, is null for 0 “ b , and 
so one of the eigenvalues must be zero,leading us to a contradiction. Hence, to conclude no such 
pair (n°, t b ) can exist. 

The exact same construction (but now with t a ) can be applied when P 2 = 0 where one of 
the effective metrics in their covariant version degenerates. Thus, the system is not symmetric 
hyperbolic for any of these two ’’pathological” cases. However the last case might still be strongly 
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hyperbolic, we shall study this case in a future work. 
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